首页> 外文OA文献 >Deep neural networks for i-vector language identification of short utterances in cars

【2h】

Deep neural networks for i-vector language identification of short utterances in cars

机译：用于i-vector语言识别汽车短话语的深度神经网络

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper is focused on the application of the Language Identification (LID) technology for intelligent vehicles. We cope with short sentences or words spoken in moving cars in four languages: English, Spanish, German, and Finnish. As the response time of the LID system is crucial for user acceptance in this particular task, speech signals of different durations with total average of 3.8s are analyzed. In this paper, the authors propose the use of Deep Neural Networks (DNN) to model effectively the i-vector space of languages. Both raw i-vectors and session variability compensated i-vectors are evaluated as input vectors to DNNs. The performance of the proposed DNN architecture is compared with both conventional GMM-UBM and i-vector/LDA systems considering the effect of durations of signals. It is shown that the signals with durations between 2 and 3s meet the requirements of this\udapplication, i.e., high accuracy and fast decision, in which the proposed DNN architecture outperforms GMM-UBM and i-vector/LDA systems by 37% and 28%, respectively.

机译：本文重点介绍语言识别（LID）技术在智能汽车中的应用。我们以四种语言处理英语中的简短句子或单词，这些语言是英语，西班牙语，德语和芬兰语。由于LID系统的响应时间对于用户在此特定任务中的接受度至关重要，因此分析了不同持续时间的语音信号，其平均时间为3.8s。在本文中，作者提出使用深度神经网络（DNN）有效地建模语言的i向量空间。原始i向量和经过会话可变性补偿的i向量都被评估为DNN的输入向量。考虑到信号持续时间的影响，将提出的DNN体系结构的性能与常规GMM-UBM和i-vector / LDA系统进行了比较。结果表明，持续时间在2到3s之间的信号满足该\ ud应用的要求，即高精度和快速决策，其中所提出的DNN架构在性能上优于GMM-UBM和i-vector / LDA系统37％和28 ％，分别。

著录项

作者
Ghahabi Esfahani, Omid; Bonafonte Cávez, Antonio; Hernando Pericás, Francisco Javier; Moreno Bilbao, M. Asunción;
展开▼
作者单位

展开▼
年度 2024
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Frame-by-frame language identification in short utterances using deep neural networks [J] . Gonzalez-Dominguez Javier, Lopez-Moreno Ignacio, Moreno Pedro J., Neural Networks: The Official Journal of the International Neural Network Society . 2015,第Null期

机译：使用深度神经网络在短话语中逐帧识别语言
2. I-vector features and deep neural network modeling for language recognition [J] . Wei Wang, Wenjie Song, Chen Chen, Procedia Computer Science . 2019,第5期

机译：我 - 矢量特征和语言识别深神经网络建模
3. Regularization of neural network model with distance metric learning for i-vector based spoken language identification [J] . Xugang Lu, Peng Shen, Yu Tsao, Computer speech and language . 2017,第JULa期

机译：基于距离度量学习的神经网络模型正则化，用于基于i-vector的口语识别
4. Deep Neural Networks for i-Vector Language Identification of Short Utterances in Cars [C] . Omid Ghahabi, Antonio Bonafonte, Javier Hernando, Annual Conference of the International Speech Communication Association . 2016

机译：用于汽车简短话语的I-Vector语言识别深神经网络
5. Deep Neural Language Model for Text Classification Based on Convolutional and Recurrent Neural Networks [D] . Hassan, Abdalraouf. 2018

机译：基于卷积神经网络和递归神经网络的深度神经语言文本分类模型
6. Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks [O] . Ruben Zazo, Alicia Lozano-Diez, Javier Gonzalez-Dominguez, 2011

机译：使用长短期记忆（LSTM）递归神经网络的短话语语言识别
7. Frame-by-frame language identification in short utterances using deep neural networks [O] . González Domínguez, Javier, López-Moreno, Ignacio, Moreno, Pedro J., 2017

机译：使用深度神经网络在短语中识别逐帧语言

Deep neural networks for i-vector language identification of short utterances in cars

摘要

著录项

相似文献

相关主题

期刊订阅